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Abstract 

Crucial to an Evolutionary Algorithm's perfor- 
mance is its selection scheme. We mathematically 
investigate the relation between polynomial rank 
and probabilistic tournament methods which are 
(respectively) generalisations of the popular linear 
ranking and tournament selection schemes. We 
show that every probabilistic tournament is equiv- 
alent to a unique polynomial rank scheme. In fact, 
we derived explicit operators for translating be- 
tween these two types of selection. Of particular 
importance is that most linear and most practical 
quadratic rank schemes are probabilistic tourna- 
ments. 

1 Introduction 

Evolutionary algorithms. Evolutionary Algorithms 
(EAs) are probabilistic search algorithms based on evo- 
lution [Gol89, ESQ3]. They operate by exploiting the in- 
formation contained in a population of possible solutions 
(via similarities between individuals). The aim is to find 
an individual that maximises (or minimises) an objective 
function, which maps from individuals to the real line. 
The population is transformed by first selecting individu- 
als. Mutation and/or recombination is then used to either 
replace a few individuals from the population or create an 
entirely new population. 

Standard selection methods. The most prevalent 
methods for selecting individuals are proportionate, lin- 
ear rank, tournament, and truncation [HL06]. In propor- 
tionate selection individuals are chosen with a probability 
proportional to their fitness (the value of the objective 
function evaluated at the individual) [Bac94]. A common 
method to gain more control over selection pressure, is to 
scale the fitness values before the selection is made [Bac94 . 
Linear ranking proceeds by ordering the population ac- 
cording to their fitness. The chance that an individual 
is selected is then a linear function of its (unique) rank 
Bac94|. Tournament selection creates a tournament by 
randomly choosing t individuals, the best individual in the 



tournament is then selected. For truncation selection the 
k fittest individuals have uniform probability of selection, 
while the remainder have zero chance of being selected. 

The choice of selection scheme is crucial to algorithm 
performance. If the selection pressure is too high then 
diversity of the population decreases rapidly and the al- 
gorithm converges prematurely to local optima or worse. 
With too little pressure there is not enough push toward 
better individuals and the population takes too long to 
converge. Many methods to choose or adapt the selec- 
tion pressure or avoid the problem otherwise have been 
invented (see |HL06j for some references). A particularly 
simple one is fitness uniform selection, which uniformly 
selects a fitness value, and then the individual with fitness 
closest to this value. 

It is quite profitable to study selection schemes due to 
their generality. They depend only on the set of fitness 
values and not on the rest of the algorithm. Hence their 
behaviour can be studied in isolation and the results ap- 
plied to any evolutionary algorithm. In this paper we in- 
troduce and study generalizations of rank and tournament 
selection (both actually only depend on the rank and not 
the absolute fitness value itself). 

Polynomial rank selection. Linear ranking has a small 
range of selection pressures (from [Bac94] . for a popula- 
tion of n individuals the probability that the fittest indi- 
vidual is selected must be between 1/n and 2/n), but it 
has the flexibility of a real- valued parameter that can vary 
continuously (the slope of the linear function). Ranking 
schemes with high selection pressures, such as when the 
probability of selection is an exponential function of the 
rank, have occasionally been used |WC02| . It is natural 
then to generalise from linear to polynomial functions to 
cover the instances where medium pressure is required. 
Hence the probability of an individual with rank k being 
selected with a polynomial rank scheme of degree d is: 

d+l 

P(I = k)=J2aik 1 - 1 (1) 

where a; £ J? are parameters defined by the algorithm de- 
signer. For simplicity we assume that selection is per- 
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formed with replacement and each individual has unique 
rank, however our results still hold when there are ties in 
the rank. The only restriction on the is that they must 
produce a proper probability distribution, i.e. for a popu- 
lation of n individuals: P(I = k)>0 for all fc=l,2,...,n and 
Sfc=i-F , (^ = ^) = l- Hence, while the population is ordered, 
the schemes may favour low ranks, high ranks or neither, 
depending on the choice of the (a;). 

This selection method encompasses the low pressures of 
linear schemes (03 = ... = a^+i = 0) and can give good ap- 
proximations of the high pressure exponential cases (via 
Taylor polynomials). Furthermore the wealth of general 
knowledge about polynomials means that while it has nu- 
merous parameters (coefficients of the monomials), it is 
also easy to predict their impact. 

Probabilistic tournament selection. Tournament se- 
lection has a large range |Bac94| . but a discrete parame- 
ter, leaving the possible selection pressures somewhat re- 
stricted. This can be overcome by selecting probabilisti- 
cally from the tournament, rather than always choosing 
the best in the tournament. However the extra parame- 
ters required are not easy to understand. Their precise 
effect on the behaviour is not at all obvious. Probabilis- 
tic Tournament selection still only sorts t<S^n individuals, 
making it much faster than any ranking scheme. 

Let i s be the (rank of the) individual in position s € 
{l,...,t} of the rank-ordered tournament. We call s the 
seed of i. Let P(I s = k) be the probability that seed s has 
rank k. In any given tournament, the probability that the 
seed s individual is chosen will be a user defined constant 
a s . Then the probability of an individual k being selected 
through a size t probabilistic tournament is: 

t 

P(I = k) = J2a s P(I s = k) (2) 

s=l 

Standard (deterministic) tournament always selects the in- 
dividual of highest rank in the tournament, i.e. ot\ = 1 and 
Q' 2 = .. = a t = 0. 

To ensure that choosing a winner from the tournament 
makes sense, the a s must satisfy the probability con- 
straints a s > OVs and X^=i a s = 1- We assume that the 
tournament is created by random selection with replace- 
ment and for now that each individual in the population 
has a unique fitness. This defines P(I S — k) (Section [2]). 
Note that even if every individual in the population is 
unique, it is possible for it to be repeated in the tourna- 
ment. 

Previous work on the relation between rank and 
tournament selection. In this paper we investigate the 
equivalence between the generalised schemes (P) and ([2]) 
with the aim of providing a scheme that combines the 
superior understanding of polynomial rank with the speed 
of probabilistic tournament. 

Back [Bac94 found that an individual's chance of se- 
lection in deterministic tournament selection is a polyno- 



mial, hence each is equivalent to a polynomial rank selec- 
tion method. Wieczorek and Czech [WC02j . and Blickle 
[BT95] arrived at the same conclusion using a different 
method. So while the name 'polynomial rank selection' is 
new, its concept is fairly old. 

The study of probabilistic tournaments isn't new either: 
Hutter |Hut91l p. 11] proved that every size 2 probabilistic 
tournament is a linear rank scheme, and Goldberg [GD91 
did the same but only for a continuous population. Fogel 
Fog88 applied to the traveling salesman problem, a vari- 
ation wherein each individual underwent numerous t=2 
tournaments. The probability of winning each tournament 
was dependent on the fitness of the individuals involved 
and the individuals selected were those with the highest 
number of wins. 

Contents: Equivalence of polynomial rank and 
probabilistic tournament selection. We extend these 
results by finding that every t sized probabilistic tourna- 
ment is equivalent to a polynomial rank scheme with a 
polynomial degree of d = t— 1 or less (Section [2]) . We con- 
tinue on to show that the equivalence is unique (Section 
and give an explicit expression for the inverse map 
(Section [4}. This allows the establishment of simple cri- 
teria for polynomial rank schemes that are probabilistic 
tournaments (Section [5| . Unfortunately not every possi- 
ble polynomial rank scheme satisfies the criteria, but most 
(and in the limit of an infinite population, all) linear and 
most "interesting" quadratic ones are equivalent to proba- 
bilistic tournaments. This is good enough for all practical 
purposes, if it generalises to higher order polynomials. 

Notation. Throughout the paper we use the following 
notation. If not otherwise indicated, an index has the full 
range as defined in this table. 



Symbol Explanation 

Sij Kronecker symbol 

(dij = 1 for i — j and 5^ — for i^j) 

n Number of individuals in the population 

i,j,k Rank (unique label) of individuals g{l,...,n} 

l,k Rank indices that only run from l,...,t 

p,q,r,s Seed index G{1,...,<} 

/ s Rank of the individual with seed s 

I Rank of the individual selected 

Tti = P(I = i) Probability that i is selected 

I Polynomial coefficients index £{l,...,d+l} 

ai Coefficients of a;' -1 for the polynomial 

0:1,012,..., at Tournament selection coefficients 

x l ,x,(x i ) Vector x=(xi) = (xi,...,x t ) 

A m m — 1 dimensional probability simplex 



2 Probability of Selection via a 
Tournament 

In this section wc find the probability of an individual be- 
ing successful (the winner) via tournament selection. This 
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will provide a formula for an equivalent ranking selection 
scheme. It is sufficient to consider just one selection event 
in isolation, since we consider selection with replacement. 

We assume a population V consisting of n individuals 
ci,...,c„ with fitness fi,...,f n - Without loss of generality 
we assume that they are ordered, i.e. f(i) > f(j) for all 
j < i. For now we also assume that all fitness values are 
different, hence individual c, has rank i. The rank is all 
we need in the following, and we will say "individual i" , 
meaning "individual Cj" . 

Definition 1 (polynomial rank selection) 

Polynomial a-ranking selects individual Ck from pop- 
ulation V with probability P{I = k) — Yl'i=i a ik 11 

Definition 2 (probabilistic tournament selection) 

A probabilistic a -tournament selects t individuals from 
population V uniformly at random with replacement. 
Let ci g be the individual of rank s in the tournament, 
called seed s (while it has rank I s in the population). 
Finally the seeds individual, I s , is chosen with probability 
a s = P{I = I s ) as the winner I. 

Theorem 3 (tournament=polynomial) Probabilistic 
a-tournament selection coincides with polynomial a- 
ranking (for d=t—l and suitable a). 

Proof. We derive an explicit expression for the probabil- 
ity 7Tfc that the tournament winner has rank k. Any seed s 
may have rank k (I s — k) and may be the winner (J= J s ), 
hence 

t 

TTfc = P(I = k) = ^P(/ = J S )P(/ S =fc) = © 
3 = 1 

where we have exploited that by definition the probability 
that I = I s is independent of the rank I s = k. P(I S = k) 
is the probability that seed s has rank k. It is difficult 
to formally derive an expression for P(I S = k), but we can 
easily get it by considering distribution functions. The 
probability of an individual selected into the tournament 
having a particular rank is 1/n, hence having rank equal to 
or less than k is k/n and larger than k is 1 — k/n. Further, 
I r < k/\I r+ \ > k if and only if r seeds have rank < k and 
t—r seeds have rank >fc, hence 

P(I r <kAl r+1 >k) = (l)(ky {1 -ky-r 

since there are (*) ways of choosing r individuals with 
rank < k from t individuals. The above expression is a 
polynomial in k of degree t. Together with 
t 

P(I S <k) = P^r < k A I r+1 > k), 

r—s 

we get the explicit expression 

P(I S = k) = P(J S <k) - P(I S <k-l) (3) 



Using the binomial theorem to find the fc* and fc* -1 
coefficients in the square brackets above reveals that the 
former coefficients cancel out while the latter do not. This 
implies that P(I S =k) is a polynomial in k of degree (at 
most) t—1, and thus the weighted average @ is as well. 
Summing © over the population yields 5Z£ =1 P '{I = k) = 1, 
as it should, since the tournament coefficients are such 
that some individual is always chosen. Consequently, 
every tournament is a polynomial rank scheme of degree 
at most t — 1 (one can choose a such that it is of lower 
degree) . □ 

Examples. Expression ([3]) can be rewritten as 

P(I s = k) = gQ[(fcl)r(l-4=l)*--(i)r(l_i)*-r] 

which will be convenient in the following examples. Stan- 
dard tournament always selects I\ (ai — 1), hence [Bac94 

P(I = k) = P(h=k) = (i-*=l)*_(i-i)* 

See Figure Q] For t—1 there is no selection pressure, 
P(7 = fc) = I. For t=2 we get 

P(j x = fc) = 2a=afc±i and P{h = k) = 2k-i 

Hence probabilistic tournaments of size 2 lead to linear 
ranking Hut91j 

P(I=k) = a l P{I l =k) + a 2 P{I 2 = k) = ai +a 2 k, 
<zi = 4 r [(2n + l)ai -a 2 ], a 2 = ^{a% - aei) (4) 

Remark. More interesting is actually the converse, 
replacing rank selections by equivalent efficient tourna- 
ments. Before we can answer this, we need to break down 
(|3|) into a product of simple regular matrices. 

3 The Map from Tournament to 
Polynomial is Unique 

The next natural question is whether different tournament 
bias a implies different selection probability. It seems 
plausible that the maps from tournaments a to rank prob- 
abilities 7T and to polynomial coefficients a are injective, 
but the proof is fairly involved. The good news is that 
construction in the proof allows us to find a closed form 
expression for the desired inverse. Let A m = {x 6 M m : 
Xi>OWi,Y^iLi x i — ^} be the m— 1 dimensional probability 
simplex, i.e. 7rGA„ and a£A f . 

Theorem 4 (tournament^polynomial) The func- 
tion i?:A t ^A„ in mapping tournament probabilities 
a to rank probabilities tt, is total, linear, and injective: 

t 

7Tfe = P(J = k) = Rk a s, i-e. 7r = Ra, 

s=l 
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Figure 1: [tournament probabilities for large n] Probability 
density nP(Ii=xn) that the tournament winner has rank 
xn, for tournament size 4 = 1,2,3,5,8. 

where R s k = P(I s = k) is defined in |[3J). Matrix R can also 
be written as aproduct R = DPFCD = VNFCD = VT 
with matrices D, P , F , C , D, V , and N defined in ([?]), 
(0), {Z2P> & , (W> and GM>- Similarly, the function 
T : At— > , mapping a to polynomial coefficients a, is 
unique, linear and infective: 



l "sj 



i.e. 



Ta, 



where matrix T = NFCD. 

Proof. Tournament always selects one individual from V 
as the winner, hence Rot £ A„ for every a 6 At. See the 
proof of Theorem [3] for how to prove this formally. 

Matrices H and G. We now prove injectivity. With 
Hl:={t) r {l-tf-r and Gl-^QiHl-H^) 
we can write ([3]) as 



R% ee P(I S = k) = 



(5) 



Einstein notation. Einstein's sum convention will be 
convenient in the following argument: When an index oc- 
curs repeatedly in the multiplication of two objects, a sum 
over the index over its full range is implicitly understood, 
e.g. G r k D s r means Y? r =i G k D r- 

Lower-triangular matrix D. The lower-triangular ma- 
trix 

D s ={ 1 if S - r (6) 
■ s if s > r [ ' 



has the property that ELi^^-EL^fc' Using Ein- 
stein's sum convention this allows us to rewrite §5§ as 

i.e. as a product of an nxt matrix G with a t x t matrix 
D. 



Inverse of D. The "inverse" of D is: 

= S k .i — 5k-i,i 



D k := 



1 if k = i 
-1 ifi = fc-l 
otherwise 



(7) 



This is a matrix with 1 on the primary diagonal; —1 on 
the diagonal that is below the primary diagonal; and 
otherwise. 

Decomposing G. G r k itself can actually be decomposed 
into D k and and a pure diagonal matrix 

comprised of the binomial coefficients: 



(8) 



Gl = (H*-Hl_ 1 )Cr q = D k HfC r q 

(note that D is the inverse of an n x n sized D matrix here) . 

Decomposing H into P and F. We can decompose 
Hf further be using the binomial identity: 



H? 



So H; = Pf F£, where P is a matrix of monomials: 



p p — (±\p 

1 i ■ \ n J ' 



(9) 



and F is a lower-triangular matrix composed of various 
binomials: 



otherwise 



(10) 



Matrices TV, V, and R. Putting everything together 
we have 

Rl = ^ k PfF^ p C r q D s r 
The (linear) map R k is a polynomial in k of degree (at 



most) t—1. We can find its coefficients by rewriting 

n-n-x = dr-c^r (h) 



1=1 



where 



VI := 



Nf 



k l 1 , and 

otherwise 



(12) 
(13) 
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Hence we get the alternative representation 
7T fc = R s k a s = V k l NfF«C r q D s r a s 



(14) 



Injective. Matrices D, D, and F are lower-triangular 
matrices with 1 in the diagonal, and hence are invertible 
(thus injective). C is diagonal and N upper triangular, 
both nowhere zero on the diagonal, hence invertible too. 
The first t rows of V map from a set oft coefficients b to the 
polynomial p(x) = S*=i^ x ' -1 evaluated at x = 1,2, 3, 
A degree t—1 polynomial like p is uniquely determined 
by t image points (see Appendix), hence V is injective. 
Similarly for P or exploit P\ = kV£(~) (no summation). 
This proves that R is injective. 

Matrix T. Combining the map from a to it 



TTjfc 



P(I = fc) = E aik 1 - 1 = V l k ai i.e. tt = Va, 



i=i 



with ai=Tfa s we get 

n k = V£7?a a 

Comparing this with ([14"|) and using injectivity of V we 
see that 

Tf = N?F«C r q D s r (15) 

which is injective, since N, F, C, and D are invertible. 

□ 



Discussion. Given a Polynomial Rank scheme it is pos- 
sible and easy (using computer software) to find if it is 
equivalent to a probabilistic tournament (and get the cor- 
responding parameters) by applying the inverse of T to a. 
If the output satisfies the probability requirements a£A ( , 
then it is indeed a probabilistic tournament. 

4 Map from Polynomial Ranking to 
Tournaments 

We now derive explicit expressions for the really interest- 
ing converse of map T, which allows replacement of ineffi- 
cient rank selections by equivalent efficient tournaments. 
From the last section we know that the inverse exists. 

Theorem 5 (polynomial^tournament) The func- 
tion T : IR 1 — > M* , mapping polynomial coefficients a to 
tournament parameters a is linear 



i=i 



i.e. 



Ta 



where matrixT=T~ 1 = D C F N , withD, C, F, N defined 
in ([?]), \16\) . |i7| ], (18\) . a-polynomial ranking can be im- 
plemented as an a-tournament if and only if a = Ta£A t . 



Inverse matrices. In the following P and V respectively 
denote the upper txt submatrix of P and V. The inverse 
matrices are as follows 



C Q r := <W(*) 

t-r- 
t—q 



F r q := (I'D if r < q and else 



N p 



(16) 
(17) 

(18) 

n l Vi^ (no summation) (19) 

The inverse of the diagonal matrix C is obvious. The 
expression for P immediately follows from P". = kV^(—) 
(no summation). 

F£F q = for r>p (since then either r>q or q>p) and 
for r < p we have 

The first equality is by definition, the second equal- 
ity is a simple reshuffling of factorials, and the last 
equality follows from the well-known binomial identity 
E™o(-) l (T) = for to > 1. This proves that F is the 
inverse of F. 

Unfortunately we were not able to invert iV directly, 
although N seems similar to (the transpose of) F. So we 
used relation (fTT) to invert N in (fT8|) . But now we need 
the inverse of P, which can be reduced by (I19|) to the 
inverse of V. 

Inverse of V . The most difficult matrix to invert is V. 
This special Vandermonde matrix V can be written as 
a product of a lower L and upper-triangular matrix U, 
whose inverses are |Tur66j : 



t: := 



(s -«)!(«-!)! 



for s > k and else 



U i := Sy — Stirling numbers of the first kind 



The Stirling numbers Ss numbers are defined as the co- 

:(x— l)...(x— s+1), i.e. 

and S[ l) = for I > s 



efficients of the polynomial x(x— l)...(x— s+1), i.e. by 



E S ° )xl - ( X -8)\ 
1=0 v ' 



There are many ways to compute Sb , e.g. recursively by 



S% 1 =S¥ L, ~sSr or directly [AS741 p.824]. For r > k 
we get 



VlV 



i=i 



s { s l) (-y- K 

(s - k)\(k - 1)! 



S — K, 



(s - k)\(k - 1)! 



[(r-l)...(r-s + l)}(-r- 
(s-k)\(k-1)\ 

a:DE(;::)H s - K = 5 *-r 



E 

S — K 



■5 



The case r<n is similar. This shows that V is the inverse 
of (the first t rows of) V. 

Linear ranking example. For t = d+l = 2 we can com- 
pute the matrices by hand. This list of (reduced) matrices 
is a useful sanity check for the reader's own implementa- 
tion: 



F = 






H = 


1 f n-1 2n-4 3n- 
n^V 14 9 


-9 ■■■ \ 
•■■ nV 


C = 


a, c- 


= if' ) 

2 V0 2/' 


G 


1 / n— 1 n— 3 "■ 
_ ?l 1 3 ■■■ 


-1+n \T 
2n-l) 


N = 


1 (n-l\ 
i?\0 2/> 




N 


= 2f a ^ 
2 VOn/ 




P = 


1 In 2n 3n •■ 
VI 4 9 ■■■ 


■ n 2 \T 
n 2 / ' 


P 


n f 4 -1\ 
— 2 V-2n n) 




V = 


m-INT 
\l2---n) ' 




V 


= L\-l) 




T = 


1 /2ri+l-l\ 
W \ -2 2 J 




T 


= nl2 IN 
4 V2 2n+1/ 




U = 


Co"! 1 ). 1 


= L\\) 


, R 


1 (2n~12n~3 ■ 
~ 7P\ 1 3 ■■ 


" 1 y 

■ 2n-l 7 



We see that -K = Ra and a = Ta coincide with ([4}, as they 
should. 

Computational complexity. Together this allows us to 
compute a from a and vice versa in time 0(t 2 ) and 7r from 
a in time 0(nt). Once a is known, tournament selection 
needs only time 0(t) per winner selection. 

5 What Polynomial Selection Schemes 
are Tournaments? 

Theorem[5]does not give us conditions under which the re- 
sulting tournament parameters ot = Ta are valid. We look 
for such conditions so that we can reliably change/create 
tournament schemes in the more understandable set of 
polynomial rank schemes. Without these conditions there 
can be no guarantee that whatever created would be a 
probabilistic tournament. 

Range of linear ranking. Let us first consider the case 
of linear ranking (c?=l), 

P(I = k) = ai + a 2 k 

We want to find the range of a\ and a 2 for which this is a 
proper probability distribution in A„ . The sum-constraint 
leads to 

1 = V" P(I = k) = ain + a 2 \n(n + 1) 
fc=i 

=> ai = 7^-\^ 2 + n)] (20) 

Next are the positivity constraints P(I — k) > OVfc. A 
linear function is > if and only if it is > at its ends, 
i.e. P(7=l)>0 and P(I=n)>0. Inserting {20]) into these 
constraints yields: 

P(I=1) = a,+a 2 >0 a 2 < 

P(I = n) = ai+a 2 n>Q a 2 > 

So the possible linear rank schemes are those with 

1^2 1 < ^rz^ and oi satisfying @Q (21) 



Range of tournament size 2. The example Q shows 
that size t = 2 probabilistic ct-tournamcnts have a 2 = 
2{a 2 —a\)/n 2 . Since a£ A 2 , a 2 has range As 
it should be, this is a subset of the possible linear rank 
schemes. Hence the linear rankings that are probabilistic 
tournaments are those with 

M < £ and ax given by ((HD (22) 

This is slightly narrower than \a 2 \ < 2 _ i i.e. there are 
some rankings that are not probabilistic tournaments. On 
the other hand, — / -4 tends to 1 as n grows, hence for 
n large (e.g. about 100) nearly all linear rankings can be 
translated into probabilistic tournaments. The coverage 
is good enough for all practical purposes. 

The general case. A probabilistic selection scheme is 
completely determined by 7r, different ir correspond to 
different selection schemes, and every ir G A„ is a valid 
selection scheme. Hence, A„ is the set of all possible 
probabilistic selection schemes. The set of (valid) size t 
tournament schemes is 

RA t := {tv = Ra. : a E A t } C A„ 

Since R is injective, this is a t — 1 dimensional irregular 
simplex embedded in the n—1 dimensional simplex A„ . 

The set of (inch invalid) degree (up to) t— 1 polynomial 
ranking schemes is 

VR f := {ir = Va : a e IR*} % A„ 

This is a i-dimensional hyperplane. Only 7r in A„ are 
valid, hence V^V\A n is the set of (valid) polynomial rank- 
ing schemes. The intersection of a simplex with a plane 
gives a closed, bounded, convex polytope, in our case of di- 
mension f-1. The Krein-Milman Theorem |Edw651 p. 707] 
says that for a closed, bounded, convex subset A of M* 
with a finite number of extreme points (^corners), A is 
the convex hull of the extreme points of A. Hence the ex- 
treme points of VFtfnAn completely characterize/define 
the set. 

If /since we are not concerned with the covering of 
VM t nA n in A„ itself, we can study the covering in the 
lower-dimensional polynomial coefficient space _ZR*. The 
set=polytope of all polynomial coefficients a that lead to 
valid selection probabilities is 

VA n := {a e R l : Va e A,,} 

while the set=simplex of coefficients reachable by tourna- 
ments is 

TA t := {a = Ta:aeA t } C VA n 

These sets are the images of VF^PiAn and the simplex 
A t under V and T respectively. These maps are injective 
(Section H]) so VA n and TA t are completely determined 
by their extreme points. The extreme points of A t are just 
the conventional M l basis vectors e s , so TA t is the convex 
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hull of {T(e a ):s = l,...,t}. The polytope ViR*nA n can be 
quite complex, and finding the extreme points daunting. 
This is essentially what we did for the t = 2 case in the 
above paragraphs. 

We estimated the proportion of degree t— 1 polynomials 
covered by TA t for various t using a Monte-Carlo algo- 
rithmic (Table [T]). It shows that for n> 100, practically all 
linear rank schemes are probabilistic tournaments. 

Nothing concrete can be concluded about the coverage 
for i = 4,5. Table [T] only suggests that the number of t — 
1 degree polynomials equivalent to i-sized tournaments 
decreases as t increases. 

Tournament size 3. In the t = 3 case we can extend our 
knowledge by finding VA n graphically. The restriction 
J^ =1 7Tfc = l means that the k 2 coefficient 03, is completely 
determined by a\ and a 2 . 

Q3 = , 2 (l - nai - \n(n + l)a 2 ) (23) 

Hence VA n is a 2 dimensional hyperplanc. P(I=k)>0 for 
each k defines a set of halfspaces: {(ai,a 2 ,a 3 ) : a\ + a%k+ 
»3fc 2 _0}; V A n is their intersection (over k) restricted to 
the plane given by (|23|) . 

TA3 is simply a filled triangle with corners {T(e s ):s — 
1,2,3}. Comparison with VA n (Figures [2 [3] and [4]) sug- 
gests that the coverage of TA3 is stable for n— >oo. Hence 
for large populations about a third of the quadratic poly- 
nomials can be written as size-3 probabilistic tournaments. 

In practice, selection schemes with probability monoton- 
ically increasing with fitness are used. So not the whole 
of VA n is interesting, but only the subset of monoton- 
ically increasing or possibly decreasing probabilities on 
{1,2,.. .,n} (light grey in figures [2] [3] and [4]) . The remain- 
der of VA n is composed of schemes that favour the middle 
ranks or both high and low ranked individuals (dark grey) . 

Any polynomial scheme P(I = k) = ai + a 2 k + 
p (l — rail — \n{n + l)a 2 )k 2 is a parabola^, so it is 
symmetric about it's stationary point, x s t. p t.- Hence 
P{I = k) is monotonic on {1,2,.. .,n} if and only if x st . p t. 
lies outside the interval (1+5,71— |). 

i.e. 

x = =^L (yA < 1 + i 

st - pL 1 - nai - \n{n + l)a 2 \^ J ~ 2 

OR 

x s t.pt. > n — - 

Figure H] suggests that these regions of usefulness effec- 
tively lie entirely in TA3 for n>300. Hence for sufficiently 
large n the most useful degree 2 polynomial schemes are 
perfectly reproduced by some probabilistic tournament. 

An example of a less applicable selection scheme is the 
polynomial given by a\ = 0.01 and a 2 = — 1 x 10~ 4 (which 

1 The t = 2 case was calculated directly from (12 1 [ i and (1221 
2 We temporarily consider k to range over the real line 





n = 4 


n = 10 


n = 20 


n = 100 


n = 300 


t = 2 


0.7500 


0.9000 


0.9500 


0.9900 


0.9967 


t = 3 


0.270 




0.348 


0.342 


0.332 


t = 4 




0.12 


0.15 


0.16 




t = 5 




0.02 









Table 1: fraction of possible t—1 degree polynomials that 
car 

1.5-1 




Figure 2: [n = 4,i = 3] The shaded region is the set of 
possible polynomials, whilst the light grey area is the set of 
the most useful polynomials. The triangle is the boundary 
of the set that can be written as i = 3 tournaments. Atp\: 
a 3 = -0.246. Atp 2 : a 3 = 0.159. At p 3 : a 3 = 0.236. At Pi : 
a 3 =0.023. 

lies in the dark grey region). It favours both high ranks 
and low ranks (Figure O and any algorithm using this 
scheme will spend half of the time searching in the wrong 
place. However it is still usable (like in fitness uniform 
selection [HL06] ). 

The points pi, p 2 ... are extreme points of V A n . They 
indicate that the range of 03 values is significantly smaller 
than the range of a 2 (which in turn has a smaller range 
than ai). 

V A n being the intersection of a finite number of halfs- 
paces and planes means its boundary is actually a series 
of straight lines. V A n appears curved in figures [3] and 2] 
simply due to the many halfspaces that are involved. 

6 Discussion/Conclusions 

Rank ties. Individuals with the same fitness lead to ties 
in the ranking. If we break ties (arbitrarily but consis- 
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Figure 5: [n=300] The polynomial y = 0.01 - 10~ 4 a; + 
2.781 x 10~ 7 x 2 . This is an example of a usable quadratic 
polynomial that is not equivalent to a probabilistic tour- 
nament. 



tcntly), our theorems still apply. The disadvantage is that 
the selection probability for two individuals with the same 
fitness may not be the same. We can fix this problem by 
breaking ties (uniformly) at random. For instance, given 
a population of 3 individuals with two of them having the 
same fitness, this results in effective selection probabilities 



Figure 3: [ra=20,i = 3] The shaded region is the set of *i =*i and n% 
possible polynomials, whilst the light grey area is the set of 
the most useful polynomials. The triangle is the boundary 
of the set that can be written as t — 3 tournaments. At 
Pi: a 3 = -8.56xl0~ 4 . At p 2 : a 3 = 1.08 x 10" 3 . At p 3 : 
a 3 = 1.91xl0- 4 . 
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Further work. Investigation of the set of possible poly- 
nomials with degree d > 3 will be helpful for those appli- 
cations requiring higher selective pressures. Furthermore, 
finding the proportion that are equivalent to probabilistic 
tournaments may provide a reliable method for making 
high-degree polynomial rank schemes more efficient. 

Tournaments of size t <C n are significantly faster than 
ranking schemes, so it would be beneficial to obtain a 
thorough understanding of how many polynomial rank 
schemes are equivalent to t > d+ 1 sized probabilistic tour- 
naments. 

Conclusion. We have found a strong connection between 
polynomial ranking and probabilistic tournament selec- 
tion. 



IP 2 



0.0 



0.005 



0.01 



0.015 



a i 



Figure 4: [n — 300, t — 3] The shaded region is the set of 
possible polynomials, whilst the light grey area is the set of 
the most useful polynomials. The triangle is the boundary 
of the set that can be written as t = 3 tournaments. At 
Pi: a 3 = -2.18 x 10~ 7 . At p 2 : a 3 = 3.53 x 10~ 7 . At p 3 : 
a 3 = 1.21xl0- 7 . 



We derived an explicit operator (|15| that maps any 
probabilistic tournament to its equivalent polynomial 
ranking scheme, which is unique and always exists. Poly- 
nomial rank schemes thus encompass linear ranking and 
deterministic (normal) tournament selection, leaving de- 
signers with one less selection method (but more parame- 
ters) to worry about. 

Unfortunately, turning polynomial rank schemes into 
equivalent probabilistic tournaments is not so straightfor- 
ward. Only about a third of the possible quadratic polyno- 
mials can be written as size-3 probabilistic tournaments. 

However, nearly all linear rank schemes have an equiva- 
lent size-2 probabilistic tournament. Hence nearly all can 
be made faster by simply rewriting the scheme as a prob- 
abilistic tournament. 

Furthermore, almost all the practical quadratic polyno- 
mials are equivalent to some t = 2 tournament. This is a 
good indication for the investigation of t>3. 
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A Appendix 

Uniqueness of a polynomial given t image points. 

Let 7r £ -R* be the vector of t image points ir K =p{x K ) 
for some xi,...,Xt of a polynomial p(x) =53*=i a ' a; ' _1 with 
coefficient vector a e M* . In particular we have 



1=1 



aiV l K , where V l K 



J-i 



If matrix V is invertible, the polynomial (coefficients) 
would be uniquely defined by a = V~ 1 ir, which is what 
we set out to prove. We now show that V is invertible. 
Define the t polynomials of degree t — 1 



p s (x) = JJ 



r=l 



Expanding the product in the numerator defines the coef- 
ficients A*. On x K we get 



S SK =Ps(Xk) 



1 = 1 



[VIA? 



hence A is the inverse of V. By explicitly expanding JIC^ - 
x r ) one can get an explicit expression for Af, which is 
unfortunately pretty useless. 
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